Automatic Language Identification Using Inductive Inference
نویسنده
چکیده
Automatic spoken language identification (LID) plays an important part in routing foreign callers to operators who speak the caller's language, or as a front-end to a multi-lingual translation system to route the call to the appropriate translation system. A common approach to spoken language ID is adopted from current speaker independent recognition techniques. These generally involve the development of a phonetic recognisor for each language and then combining the acoustic likelihood scores to determine the highest scoring language. The models are trained using hidden Markov modelling (HMM) or neural networks (NN). This paper proposes a novel approach to spoken language identification by the use of inductive inference "decision trees". To develop the production rules, the classification models are generated inductively by examining a large speech database and then generalising the pattern from the specific examples. This approach has already been successfully used for isolated digit recognition (Samouelian, 1996). The aim of this research is to demonstrate that inductive learning can provide a viable alternative approach to existing automatic spoken language identification techniques. The proposed LID is based on automatic speech recognition (ASR) system using inductive inference (Samouelian, 1994a, 1994b). It uses a single decision tree to capture all the complexities of each language, using mel-scaled cepstral coefficients (MFCC) as input. The training database is labelled at the language level. The LID classification is performed at the frame level, using an inference engine to execute the decision tree and classify the firing of the rules. A simple sorting routine is then used to identify the spoken language. Spoken language identification results using the OGI Multi-language Telephone Speech Corpus (OGI-TS) on the 3 language task (English, German and Japanese), are presented.
منابع مشابه
Use of inductive learning for speech processing
This paper proposes the use of inductive inference "decision trees" for speech processin applications such as automatic speech recognition (ASR), automatic language identification (ID), speech understanding and speaker verification. The aim of this research is to demonstrate that Artificial Intelligene (AI) techniques such as inductive learning can provide an alternative approach to existing sp...
متن کاملTowards Automatic Learning of a Structure Ontology For Technical Articles
Despite the high level of success attained by keyword based information retrieval methods, a significant fraction of information retrieval tasks still needs to take into account the semantics of the data. We propose a method that combines an hand-crafted ontology with a robust inductive inference method to assign semantic labels to pieces of technical articles available on the Web. This approac...
متن کاملMind Change Speed-up for Learning Languages from Positive Data
Within the frameworks of learning in the limit of indexed classes of recursive languages from positive data and automatic learning in the limit of indexed classes of regular languages (with automatically computable sets of indices), we study the problem of minimizing the maximum number of mind changes FM(n) by a learner M on all languages with indices not exceeding n. For inductive inference of...
متن کاملCharacterizing Language Identification in Terms of Computable Numberings
Identification of programs for computable functions from their graphs and identification of grammars (r. e. indices) for recursively enumerable languages from positive data are two extensively studied problems in the recursion theoretic framework of inductive inference. In the context of function identification, Freivalds, Kinber, and Wiehagen have shown that only those collections of functions...
متن کاملConnected Digit Recognition Using Inductive Inference
This paper proposes a novel approach to connected digit recognition by the use of inductive inference "decision trees". To develop the production rules, the expert is bypassed and instead the classification models are generated inductively by examining a large speech database and then generalising the pattern from the specific examples. This approach has already been successfully used for isola...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996